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1 CAGCACGAGG AACTCCTTCT GATCACCTGG CCAGCTGAGG TCAGAGTGGG 

51 AGAGGCAGTG GTTCCATTGA AGGAGTACTC CTAACTGTCA GAAGCCTGGG 

101 CGGTCAGGAT GGGGTGCTGT CGCTTGGGCT GCGGGGGGTG TTCAGTTGCC 

151 CACAGTGTAT CTCAGGGTCT CACCAACCAT CCAAGCATGG TAGGCTGTGG 

201 CTGGCACCCA GGGTTGTGTG GCTGGGGAGG TGGTCTCCAC AGTTCCCTCC 

251 CTGCCCTCCC AGGGCCCCCA TCCATGCAGG TAACCATCGA GGATGTGCAG 

301 GCACAGACAG GCGGAACGGC CCAATTCGAG GCTATCATTG AGGGCGACCC 

351 ACAGCCCTCG GTGACCTGGT ACAAGGACAG CGTCCAGCTG GTGGACAGCA 

401 CCCGGCTTAG CCAGCAGCAA GAAGGCACCA CATACTCCCT GGTGCTGAGG 

451 CATGTGGCCT CGAAGGATGC CGGCGTTTAC ACCTGCCTGG CCCAAAACAC 

501 TGGTGGCCAG GTGCTCTGCA AGGCAGAGCT GCTGGTGCTT GGGGGGGACA 

551 ATGAGCCGGA CTCAGAGAAG CAAAGCCACC GGAGGAAGCT GCACTCCTTC 

601 TATGAGGTCA AGGAGGAGAT TGGAAGGGGC GTGTTTGGCT TCGTAAAAAG 

651 AGTGCAGCAC AAAGGAAACA AGATCTTGTG CGCTGCCAAG TTCATCCCCC 

701 TACGGAGCAG AACTCGGGCC CAGGCATACA GGGAGCGAGA CATCCTGGCC 

751 GCGCTGAGCC ACCCGCTGGT CACGGGGCTG CTGGACCAGT TTGAGACCCG 

801 CAAGACCCTC ATCCTCATCC TGGAGCTGTG CTCATCCGAG GAGCTGCTGG 

851 ACCGCCTGTA CAGGAAGGGC GTGGTGACGG AGGCCGAGGT CAAGGTCTAC 

901 ATCCAGCAGC TGGTGGAGGG GCTGCACTAC CTGCACAGCC ATGGCGTTCT 

951 CCACCTGGAC ATAAAGCCCT CTAACATCCT GATGGTGCAT CCTGCCCGGG 

1001 AAGACATTAA AATCTGCGAC TTTGGCTTTG CCCAGAACAT CACCCCAGCA 

1051 GAGCTGCAGT TCAGCCAGTA CGGCTCCCCT GAGTTCGTCT CCCCCGAGAT 

1101 CATCCAGCAG AACCCTGT6A GCGAAGCCTC CGACATTTGG GCCATGGGTG 

1151 TCATCTCCTA CCTCAGCCTG ACCTGCTCAT CCCCATTTGC CGGCGAGAGT 

1201 GACCGTGCCA CCCTCCTGAA CGTCCTGGAG GGGCGCGTGT CATGGAGCAG 

1251 CCCCATGGCT GCCCACCTCA GCGAAGACGC CAAAGACTTC ATCAAGGCTA 

1301 CGCTGCAGAG AGCCCCTCAG GCCCGGCCTA GTGCGGCCCA GTGCCTCTCC 

1351 CACCCCTGGT TCCTGAAATC CATGCCTGCG GAGGAGGCCC ACTTCATCAA 

1401 CACCAAGCAG CTCAAGTTCC TCCTGGCCCG AAGTCGCTGG CAGCGTTCCC 

1451 TGATGAGCTA CAAGTCCATC CTGGTGATGC GCTCCATCCC TGAGCTGCTG 

1501 CGGGGCCCAC CCGACAGCCC CTCCCTCGGC GTAGCCCGGC ACCTCTGCAG 

1551 GGACACTGGT GGCTCCTCCA GTTCCTCCTC CTCCTCTGAC AACGAGCTCG 

1601 CCCCATTTGC CCGGGCTAAG TCACTGCCAC CCTCCCCGGT GACACACTCA 

1651 CCACTGCTGC ACCCCCGGGG CTTCCTGCGG CCCTCGGCCA GCCTGCCTGA 

1701 GGAAGCCGAG GCCAGTGAGC GCTCCACCGA GGCCCCAGCT CCGCCTGCAT 

1751 CTCCCGAGGG TGCCGGGCCA CCGGCCGCCC AGGGCTGCGT GCCCCGGCAC 

1801 AGCGTCATCC GCAGCCTGTT CTACCACCAG GCGGGTGAGA GCCCTGAGCA 

1851 CGGGGCCCTG GCCCCGGGGA GCAGGCGGCA CCCGGCCCGG CGGCGGCACC 

1901 TGCTGAAGGG CGGCTACATT GCGGGGGCGC TGCCAGGCCT GCGCGAGCCA 

1951 CTGATGGAGC ACCGCGTGCT GGAGGAGGAG GCCGCCAGGG AGGAGCAGGC 

2001 CACCCTCCTG GCCAAAGCCC CCTCATTCGA GACTGCCCTC CGGCTGCCTG 

2051 CCTCTGGCAC CCACTTGGCC CCTGGCCACA GCCACTCCCT GGAACATGAC 

2101 TCTCCGAGCA CCCCCCGCCC CTCCTCGGAG GCCTGCGGTG AGGCACAGCG 

2151 ACTGCCTTCA GCCCCCTCCG GGGGGGCCCC TATCAGGGAC ATGGGGCACC 

2201 CTCAGGGCTC CAAGCAGCTT CCATCCACTG GTGGCCACCC AGGCACTGCT 

2251 CAGCCAGAGA GGCCATCCCC GGACAGCCCT TGGGGGCAGC CAGCCCCTTT 

2301 CTGCCACCCC AAGCAGGGTT CTGCCCCCCA GGAGGGCTGC AGCCCCCACC 

2351 CAGCAGTTGC CCCATGCCCT CCTGGCTCCT TCCCTCCAGG ATCTTGCAAA 

2401 GAGGCCCCCT TAGTACCCTC AAGCCCCTTC TTGGGACAGC CCCAGGCACC 

2451 CCCTGCCCCT GCCAAAGCAA GCCCCCCATT GGACTCTAAG ATGGGGCCTG 

2501 GAGACATCTC TCTTCCTGGG AGGCCAAAAC CCGGCCCCTG CAGTTCCCCA 

2551 GGGTCAGCCT CCCAGGCGAG CTCTTCCCAA GTGAGCTCCC TCAGGGTGGG 

2601 CTCCTCCCAG GTGGGCACAG AGCCTGGCCC CTCCCTGGAT GCGGAGGGCT 

2651 GGACCCAGGA GGCTGAGGAT CTGTCCGACT CCACACCCAC CTTGCAGCGG 

2701 CCTCAGGAAC AGGTGACCAT GCGCAAGTTC TCCCTGGGTG GTCGCGGGGG 

2751 CTACGCAGGC GTGGCTGGCT ATGGCACCTT TGCCTTTGGT GGAGATGCAG 

2801 GGGGCATGCT GGGGCAGGGG CCCATGTGGG CCAGGATAGC CTGGGCTGTG 

2851 TCCCAGTCGG AGGAGGAGGA GGAGGAGGAG GCCAGGGCTG AGTCCCAGTC 

2901 GGAGGAGGAG CAGGAGGCCA GGGCTGAGAG CCCACTGCCC CAGGTCAGTG 

2951 CAAGGCCTGT GCCTGAGGTC GGCAGGGCTC CCACCAGGAG CTCTCCAGAG 

3001 CCCACCCCAT GGGAGGACAT CGGGCAGGTC TCCCTGGTGC AGATCCGGGA 

3051 CCTGTCAGGT GATGCGGAGG CGGCCGACAC AATATCCCTG GACATTTCCG 

3101 AGGTGGACCC CGCCTACCTC AACCTCTCAG ACCTGTACGA TATCAAGTAC 

3151 CTCCCATTCG AGTTTATGAT CTTCAGGAAA GTCCCCAAGT CCGCTCAGCC 

3201 AGAGCCGCCC TCCCCCATGG CTGAGGAGGA GCTGGCCGAG TTCCCGGAGC 

3251 CCACGTGGCC CTGGCCAGGT GAACTGGGCC CCCACGCAGG CCTGGAGATC 

3301 ACAGAGGAGT CAGAGGATGT GGACGCGCTG CTGGCAGAGG CTGCCGTGGG 

3351 CAGGAAGCGC AAGTGGTCCT CGCCGTCACG CAGCCTCTTC CACTTCCCTG 

3401 GGAGGCACCT GCCGCTGGAT GAGCCTGCAG AGCTGGGGCT GCGTGAGAGA 

3451 GTGAAGGCCT CCGTGGAGCA CATCTCCCGG ATCCTGAAGG GCAGGCCGGA 
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3501 AGGTCTGGAG AAGGAGGGGC CCCCCAGGAA GAAGCCAGGC CTTGCTTCCT 
3551 TCCGGCTCTC AGGTCTGAAG AGCTGGGACC GAGCGCCGAC ATTCCTAAGG 
3601 GAGCTCTCAG ATGAGACTGT GGTCCTGGGC CAGTCAGTGA CACTGGCCTG 
3651 CCAGGTGTCA GCCCAGCCAG CTGCCCAGGC CACCTGGAGC AAAGACGGAG 
3701 CCCCCCTGGA GAGCAGCAGC CGTGTCCTCA TCTCTGCCAC CCTCAAGAAC 
3751 TTCCAGCTTC TGACCATCCT GGTGGTGGTG GCTGAGGACC TGGGTGTGTA 
3801 CACCTGGAGC GTGAGCAATG CGCTGGGGAC AGTGACCACC ACQGGCGTCC 
3851 TCCGGAAGGC AGAGCGCCCC TCATCTTCGC CATGCCCGGA TATCGGGGAG 
3901 GTGTACGCGG ATGGGGTGCT GCTGGTCTGG AAGCCCGTGG AATCCTACGG 
3951 CCCTGTGACC TACATTGTGC AGTGCAGCCT AGAAGGCGGC AGCTGGACCA 
4001 CACTGGCCTC CGACATCTTT GACTGCTGCT ACCTGACCAG CAAGCTCTCC 
4051 CGGGGTGGCA CCTACACCTT CCGCACGGCA TGTGTCAGCA AGGCAGGAAT 
4101 GGGTCCCTAC AGCAGCCCCT CGGAGCAAGT CCTCCTGGGA GGGCCCAGCC 
4151 ACCTGGCCTC TGAGGAGGAG AGCCAGGGGC GGTCAGCCCA ACCCCTGCCC 
4201 AGCACAAAGA CCTTCGCATT CCAGACACAG ATCCAGAGGG GCCGCTTCAG 
4251 CGTGGTGCGG CAATGCTGGG AGAAGGCCAG CGGGCGGGCG CTGGCCGCCA 
4301 AGATCATCCC CTACCACCCC AAGGACAAGA CAGCAGTGCT GCGCGAATAC 
4351 GAGGCCCTCA AGGGCCTGCG CCACCCGCAC CTGGCCCAGC TGCACGCAGC 
44 01 CTACCTCAGC CCCCGGCACC TGGTGCTCAT CTTGGAGCTG TGCTCTGGGC 
4451 CCGAGCTGCT CCCCTGCCTG GCCGAGAGGG CCTCCTACTC AGAATCTGAG 
4501 GTGAAGGACT ACCTGTGGCA GATGTTGAGT GCCACCCAGT ACCTGCACAA 
4551 CCAGCACATC CTGCACCTGG ACCTGAGGTC CGAGAACATG ATCATCACCG 
4601 AATACAACCT GCTCAAGGTC GTGGACCTGG GCAATGCACA GAGCCTCAGC 
4651 CAGGAGAAGG TGCTGCCCTC AGACAAGTTC AAGGACTACC TAGAGACCAT 
4701 GGCTCCAGAG CTCCTGGAGG GCCAGGGGGC TGTTCCACAG ACAGACATCT 
4751 GGGCCATCGG TGTGACAGCC TTCATCATGC TGAGCGCCGA GTACCCGGTG 
4801 AGCAGCGAGG GTGCACGCGA CCTGCAGAGA GGACTGCGCA AGGGGCTGGT 
4851 CCGGCTGAGC CGCTGCTACG CGGGGCTGTC CGGGGGCGCC GTGGCCTTCC 
4901 TGCGCAGCAC TCTGTGCGCC CAGCCCTGGG GCCGGCCCTG CGCGTCCAGC 
4951 TGCCTGCAGT GCCCGTGGCT AACAGAGGAG GGCCCGGCCT GTTCGCGGCC 
5001 CGCGCCCGTG ACCTTCCCTA CCGCGCGGCT GCGCGTCTTC GTGCGCAATC 
5051 GCGAGAAGAG ACGCGCGCTG CTGTACAAGA GGCACAACCT GGCCCAGGTG 
5101 CGCTGAGGGT CGCCCCGGCC ACACCCTTGG TCTCCCCGCT GGGGGTCGCT 
5151 GCAGACGCGC CAATAAAAAC GCACAGCCGG GCGAGAAAAA AAAAAAAAAA 
5201 AAAAAAA (SEQ ID N0:1) 

FEATXJRES : 

Start: 109 

Stop: 5104 

Homologous proteins: 
Top BLAST Hits: 



gi 

gi 

gi| 

gi| 

gi| 

gi| 

gi| 

gi| 

gi| 

gi| 

gi| 

gi| 

gi| 



7242949 dbj | BAA92535 . 1 | (AB037718) KIAA1297 protein [Homo sa. 
8928460 sp|075962|TRIO_HUMAN TRIPLE FUNCTIONAL DOMAIN PROTEI, 
6005922 ref |NP_009049.l| triple functional domain (PTPRF int. 
3024081 sp|Q15746|KMLS_HUMAN MYOSIN LIGHT CHAIN KINASE, SMOO. 
90103 |pir I j A41674 myosin -light -chain kinase (EC 2.7.1.117), . 
7239696 |gb|AAC18423.2 I (U48959) myosin light chain kinase [H. 
7239698 |gb|AAD15921.2 I (AF069601) myosin light chain kinase . 
1103677|emb|CAA62378.l| (X90870) myosin- light -chain kinase [. 



3024085 I sp 
2851405 I sp 
3982821 jgb 
3982823 jgb 
3982827 |gb 
3982807 jgb 



Q28824|KMLS_BOVIN MYOSIN LIGHT CHAIN KINASE, SMOO. 
P29294|KMLS_RABIT MYOSIN LIGHT CHAIN KINASE, SMOO. 
AAC83683.1 (AF081663) myosin light chain kinase . 
AAC83684.1 (AF081664) myosin light chain kinase . 
AAC83686,1 (AF081666) myosin light chain kinase . 
AAC83676.1 (AF081656) myosin light chain kinase . 



BLAST dbEST hit; 

gi 1 7958129 /dataset=tdbest /taxon=960 . . . 
EXPRESSION INFORMATION FOR MODULATORY USE: 

From BLAST dbEST hit: 

gi 1 7958129 Human Colon carcinoma 

From PCR-based tissue screening panels: 

Human Placenta 

Human Kidney 

Human Lung 

Human skeletal muscle 
Human heart 

Human fetal whole brain 
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1 MGCCRLGCGG CSVAHSVSQG LTNHPSMVGC GWHPGLCGWG GGLHSSLPAL 
51 PGPPSMQVTI EDVQAQTGGT AQFEAIIEGD PQPSVTWYKD SVQLVDSTRL 
101 SQQQEGTTYS LVLRHVASKD AGVYTCLAQN TGGQVLCKAE LLVLGGDNEP 
151 DSEKQSHRRK LHSFYEVKEE IGRGVFGFVK RVQHKGNKIL CAAKFIPLRS 
201 RTRAQAYRER DILAALSHPL VTGLLDQFET RKTLILILEL CSSEELLDRL 
251 YRKGyVTEAE VIO/YIQQLVE GLHYLHSHGV LHLDIKPSNI LMVHPAREDI 
301 KICDFGFAQN ITPAELQFSQ YGSPEFVSPE IIQQNPVSEA SDIWAMGVIS 
351 YLSLTCSSPF AGESDRATLL NVLEGRVSWS SPMAAHLSED AKDFIKATLQ 
401 RAPQARPSAA QCLSHPWFLK SMPAEEAHFI NTKQLKFLLA RSRWQRSLMS 
451 YKSILVMRSI PELLRGPPDS PSLGVARHLC RDTGGSSSSS SSSDNELAPF 
501 ARAKS LPPSP VTHSPLIiHPR GFLRPSASLP EEAEASERST EAPAPPASPE 
551 GAGPPAAQGC VPRHSVIRSL FYHQAGESPE HGALAPGSRR HPARRRHLLK 
601 GGYIAGALPG LREPLMEHRV LEEEAAREEQ ATLLAKAPSF ETALRLPASG 
651 THIiAPGHSHS LEHDSPSTPR PSSEACGEAQ RLPSAPSGGA PIRDMGHPQG 
701 SKQLPSTGGH PGTAQPERPS PDSPWGQPAP FCHPKQGSAP QEGCSPHPAV 
751 APCPPGSFPP GSCKEAPLVP SSPFLGQPQA PPAPAKASPP LDSKMGPGDI 
801 SLPGRPKPGP CSSPGSASQA SSSQVSSLRV GSSQVGTEPG PSLDAEGWTQ 
851 EAEDLSDSTP TLQRPQEQVT MRKFSLGGRG GYAGVAGYGT FAFGGDAGGM 
901 LGQGPMWARI AWAVSQSEEE EQEEARAESQ SEEQQEARAE SPLPQVSARP 
951 VPEVGRAPTR SSPEPTPWED IGQVSLVQIR DLSGDAEAAD TISLDISEVD 
1001 PAYLNLSDLY DIKYLPFEFM IFRKVPKSAQ PEPPSPMAEE ELAEFPEPTW 
1051 PWPGELGPHA GLEITEESED VDALLAEAAV GRKRKWSSPS RSLFHFPGRH 
1101 LPLDEPAELG LRERVKASVE HISRILKGRP EGLEKEGPPR KKPGLASFRL 
1151 SGLKSWDRAP TFLRELSDET WLGQSVTLA CQVSAQPAAQ ATWSKDGAPL 
1201 ESSSRVLISA TLKNFQLLTI LVWAEDLGV YTCSVSNALG TVTTTGVLRK 
1251 AERPSSSPCP DIGEVYADGV LLVWKPVESY GPVTYIVQCS LEGGSWTTLA 
1301 SDIFDCCYLT SKLSRGGTYT FRTACVSKAG MGPYSSPSEQ VLLGGPSHLA 
1351 SEEESQGRSA QPLPSTKTFA FQTQIQRGRF SWRQCWEKA SGRALAAKII 
1401 PYHPKDKTAV LREYEALKGL RHPHLAQLHA AYLSPRHLVL ILELCSGPEL 
1451 LPCLAERASY SESEVKDYLW QMLSATQYLH NQHILHLDLR SENMIITEYN 
1501 LLKWDLGNA QSLSQEKVLP SDKFKDYLET MAPELLEGQG AVPQTDIWAI 
1551 GVTAFIMLSA EYPVSSEGAR DLQRGLRKGL VRLSRCYAGL SGGAVAFLRS 
1601 TLCAQPWGRP CASSCLQCPW LTEEGPACSR PAPVTFPTAR LRVFVRNREK 
1651 RRALLYKRHN LAQVR (SEQ ID NO: 2) 

FEATtJRES : 

Functional domains and key regions: 
Proslte results: 

[1] PDOCOOOOl PSOOOOl ASN_GLYCOSYIiATION 

N-glycosylation site 

1005-1008 NLSD 



[2] PDOC00004 PS00004 CAMP_PHOSPHO_SITE 

cAMP- and cGMP- dependent protein kinase phosphorylation site 

Niimber o£ matches: 2 

1 872-875 RKFS 

2 1084-1087 RKWS 



[31 PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 
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[4] PDOC00006 PS00006 CK2_PH0SPH0_SITE 
Casein kinase II phosphorylation site 
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[5] PDOC00008 PSOQ008 MYRISTYL 
N-myristoylation site 
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[6] PDOC00009 PS00009 ANZDATION 
Amidation site 



1080-1083 VGRK 



[7] PDOC00373 PS00343 GRAM_POS_ANCHORING 

Gram-positive cocci surface proteins 'anchoring' hexapeptide 



704-709 LPSTGG 



[8] PDOCOOlOO PS00107 PROTEIN_KINASB_ATP 
Protein kinases ATP -binding region signature 

171-194 IGRGVFGFVKRVQHKGNKILCAAK 



[9] PDOCOOlOO PS00108 PROTEIN_KINASE_ST 
Serine/Threonine protein kinases active-site signature 



280-292 VLHLDIKPSNILM 



[10] PDOCOOlOO PS00109 PROTEIN_KINASE_TYR 

Tyrosine protein kinases specific active-site signature 

1484-1496 ILHLDLRSENMII 



[11] PDOC00565 PS00659 GLYC0SYL_HYDR0L_F5 
Glycosyl hydrolases family 5 signature 



142-151 LVLGGDNEPD 



BLAST Alignment to Top Hits: 

>gi|7242949|dbj |BAA92535.1| (AB037718) KIAA1297 protein [Homo sapiens] 
Length = 2242 

Score = 425 bits (1081) , Expect = e-117 

Identities = 305/876 (34%), Positives = 423/876 (47%), Gaps = 106/876 (12%) 

PSMQVTIEDVQAQTGGTAQFEAIIEGDPQPSVTWYKDSVQLVDSTRLSQQQEGTTYSLVL 113 
P + +EDV+ G TA+F ++EG P P + WYKD V L +S+ +S E SLV+ 
PRFESIMEDVEVGAGETARFAVWEGKPLPDIMWYKDEVLLTESSHVSFVYEENECSLW 563 

RHVASKDAGVYTCLAQNTGGQVLCKAELLVLGGDN EPDSEKQSHR-RKLHSFYEVK 16 8 

++D GVYTC AQN G+V CKAEL V E E + HR R+L FY++ 

LSTGAQDGGVYTCTAQNLAGEVSCKAELAVHSAQTAMEVEGVGEDEDHRGRRLSDFYDIH 623 

EEIGRGVFGFVKRVQHKGNKILCAAKFIPLRSRTRAQAYRERDILAALSHPLVTGLLDQF 228 
+EIGRG F +++R+ + + + AAKFIP +++ +A A RE +LA L H V + F 
QEIGRGAFSYLRRIVERSSGLEFAAKFIPSQAKPKASARREARLLARLQHDCVLYFHEAF 683 

ETRKTLILILELCSSEELLDRLYRKGWTEAEVKVYIQQLVEGLHYLHSHGVLHLDIKPS 288 
E R+ L+++ ELC+ EELL+R+ RK V E+E++ Y++Q++EG+HYLH VLHLD+KP 
ERRRGLVIVTELCT-EELLERIARKPTVCESEIRAYMRQVLEGIHYLHQSHVLHLDVKPE 742 

NILMVHPA- -REDIKICDFGFAQNITPAELQFSQYGSPEFVSPEIIQQNPVSEASDIWAM 346 
N+L+ A + ++ICDFG AQ +TP E Q+ QYG+PEFV+PEI+ Q+PVS +DIW + 
NLLVWDGAAGEQQVRICDFGNAQELTPGEPQYCQYGTPEFVAPEIVNQSPVSGVTDIWPV 8 02 

GVISYLSLTCSSPFAGESDRATLLNVLEGRVSWSSPMAAHLSEDAKDF- IKATLQRAPQA 4 05 
GV+++L LT SPF GE+DR TL+N+ V++ LS +A+ F IK +Q + 

GWAFLCLTGISPFVGENDRTTLMNIRNYNVAFEETTFLSLSREARGFLIKVLVQ- -DRL 860 

RPSAAQCLSHPWFLKSMPAEEAHFINTKQLKFLLARSRWQRSLMSYKSILVMRSIPELLR 465 
RP+A + L HPWF E ++T LK L+R RWQRS +SYK LV+R IPELLR 
RPTAEETLEHPWFKTQAKGAE VSTDHLKLFLSRRRWQRSQI SYKCHLVLRP I PELLR 917 

GPPDSPSLGVARHLCRDTGGSSSSSSSSDNELAPFARAK SLPPSPVTH 513 

PP+ + + R +GG SSSS S + EL SL P 

APPERVWVTMPRR-PPPSGGLSSSSDSEEEELEELPSVPRPLQPEFSGSRVSLTDIPTED 976 

SPLLHPRGFLRPSASLPEEAEASERSTEAPAPPASPEGAGPPT^QGCVPRHSVI 567 
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LP E+ A + EAP+P A P PAA G PR + 

Sbjct: 977 EALGTPETGAATPMDWQEQGRAPSQDQEAPSPEALPSPGQEPAA-GASPRRGELRRGSSA 1035 

Query: 568 RSLFYHQAGESPEHGALAPG SRRHPARRRHLLK 600 

R L + E P+ + PG ++R A R+ LL+ 

Sbjct: 1036 ESALPRAGPRELGRGLHKAASVELPQRRSPGPGATRLARGGLGEGEYAQRLQALRQRLLR 1095 

Query: 601 GGYIAGALPGLREPLMEH RVLEEEAAREEQATL LAKAPSFETALR 645 

GG G + GLR PL+E R EAA Q L L K+ SF 

Sbjct: 1096 GGPEDGKVSGLRGPLLESLGGRARDPRMARAASSEAAPHHQPPLENRGLQKSSSFSQGEA 1155 

Query: 646 LPASGTHLAPGHSHSLEHDSPSTPR PSSEACGEAQRLPSAPSGGAPIRDMGHPQGS 701 

PGHG + R PS A EAQ PS+P+ P 
Sbjct: 1156 EP-RGRHRRAGAPLEIPVARIiGARRLQESPSLSALSEAQ--PSSPA RPSAP 1203 

Query: 702 KQLPSTGGHPGTAQPERPSPDSPWGQPAPFCHPKQGSAPQEGCSPHPAVAPCPP GS 757 

K PST P +A+P +P PAP P Q AP+ P A P PP + 

Sbjct: 1204 K--PST---PKSAEPSATTPSDAPQPPAP--QPAQDKAPEPRPEPVRASKPAPPPQALQT 1256 

Query: 758 FPPGSCKEAPLVPSSPFLGQPQAPPAPAKASPPLDSKMGPGDISLPGRPKPGPCSSPGSA 817 

A ++ S G Q P+ A+PP + K + P PG + 

Sbjct: 1257 LALPLTPYAQIIQSLQLSGHAQG-PSQGPAAPPSEPKPHAAVFARVASPPPG--APEKRV 1313 

Query: 818 SQASSSQVSSLRVGSSQVGTEPGPSLDAEGWTQEAE 853 

A V + + V PG SL + E+E 

Sbjct: 1314 PSAGGPPVLAEKARVPTVPPRPGSSLSSSIENLESE 1349 (SEQ ID N0:4) 



Score = 210 bits (529) , Expect = le-52 

Identities = 111/281 (39%), Positives = 156/281 (55%), Gaps = 2/281 (0%) 

Query: 1336 SPSEQVLLGGPSHLASEEESQGRSAQPLPSTKTFAFQTQIQRGRFSWRQCWEKASGRAL 1395 

SP+++V+ S S +G + +PK + F + RGRF WR C E A+GR 
Sbjct: 1952 SPAKEWSSPGSSPRSSPRPEGTTLRQGPPQKPYTFLEEKARGRFGWRACRENATGRTF 2011 

Query: 1396 AAKIIPYHPKDKTAVLREYEALKGLRHPHLAQLHAAYLSPRHLVLILELCSGPELLPCLA 1455 

AKI+PY + K VL+EYE L+ L H + LH AY++PR+LVLI E C ELL L+ 
Sbjct: 2012 VAKIVPYAAEGKPRVLQEYEVLRTLHHERIMSLHEAYITPRYLVLIAESCGNRELLCGLS 2071 

Query: 1456 ERASYSESEVKDYLWQMLSATQYLHNQHILHLDLRSENMIITEYNLLKWDLGNAQSLSQ 1515 

+R YSE +V Y+ Q+L YLH H+LHLD++ +N+++ N LK+VD G+AQ + 
Sbjct: 2072 DRFRYSEDDVATYMVQLLQGLDYLHGHHVLHLDIKPDNLLLAPDNALKIVDFGSAQPYNP 2131 

Query: 1516 EKVLPSDKFKDYLETMAPELLEGQGAVPQTDIWAIGVTAFIMLSAEYPVSSEGARDLQRG 1575 

+ + p LE MAPE+++G+ TDIW GV +IMLS P ++ + 

Sbjct: 2132 QALRPLGHRTGTLEFMAPEMVKGEPIGSATDIWGAGVLTYIMLSGRSPFYEPDPQETEAR 2191 

Query: 1576 LRKGLVRLSRCYAGLSGGAVAFLRSTLCAQPWGRPCASSCL 1616 

+ G + Y S A FLR L PW RP SSCL 

Sbjct: 2192 IVGGRFDAFQLYPNTSQSATLFLRKVLSVHPWSRP--SSCL 2230 (SEQ ID NO: 5) 



Score = 170 bits (426), Expect = le-40 

Identities = 168/574 (29%) , Positives = 256/574 (44%) , Gaps = 42/574 (7%) 

Query: 1103 LDEP--AELGLRERVKASVEHISRILKGRPEGLEKEGPPRKKPGLASFRLSGLKSWDRAP 1160 

L EP A GLR+ V+HI R+L + KPP+ L Ii ++AP 
Sbjct: 358 LREPGWAATGLRK GVQHIFRVLSTTVKSSSKPSPPSEPVQL LEHGPTLEEAP 409 

Query: 1161 TFLRELSDETWLGQSVTLACQVSAQPAAQATW-SKDGAPLESSSRVL-ISATLKNFQLL 1218 

L + W GQ ++ + AQ W S GA LE+ + V +S + L 

Sbjct: 410 AMLDKPDIVYWEGQPASVTVTFN-HVEAQWWRSCRGALLEARAGVYELSQPDDDQYCL 468 

Query: 1219 TILWVAEDLGVYTCSVSNALGTVTTTGVLRKAERPS-SSPCPDI GEVYADGVLLV 1273 

I V D+G TC+ N GT T + L AE P S D+ GE V++ 
Sbjct: 469 RICRVSRRDMGALTCTARNRHGTQTCSVTLELAEAPRFESIMEDVEVGAGETARFAVWE 528 

Query: 1274 WKPVESYGPVTYIVQCSLEGGSWTTLASDIFDCCY- -LTSKLSRGGTYTFRTACVSKAGM 1331 

KP+ +Y + L S + ++C L++ GG YT C++ 
Sbjct: 529 GKPLPDI--MWYKDEVLLTESSHVSFVYEENECSLVVLSTGAQDGGVYT CTAQNLA 582 



FIGURE 2D 



Docket No.: CL000927.CIP-DIV2 
Serial No.: To be assigned 
Inventors: Ming-Hui WEI et al. 
Title: ISOLATED HUMAN KINASE PROTEINS... 

Query: 1332 GPYSSPSEQVLLGGPSHLASEEESQGRSAQPLPSTKTFAFQTQIQRGRFSWRQCWEKAS 1391 

G S +E + + + E + + + + +1 RG FS +R+ E++S 
Sbjct: 583 GEVSCKAELAVHSAQTAMEVEGVGEDEDHRGRRLSDFYDIHQEIGRGAFSYLRRIVERSS 642 

Query: 1392 GRALAAKIIPYHPKDKTAVLREYEALKGLRHPHLAQLHAAYLSPRHLVLILELCSGPELL 1451 

G AAK IP K K + RE L L+H + H A+ R LV++ ELC+ ELL 
Sbjct: 643 GLEFAAKFIPSQAKPKASARREARLLARLQHDCVLYFHEAFERRRGLVIVTELCT-EELL 701 

Query: 1452 PCLAERASYSESEVKDYLWQMLSATQYLHNQHILHLDLRSENMIITE YNLLKWDL 1507 

+A + + ESE++ Y+ Q+L YLH H+LHLD++ EN+++ + +++ D 

Sbjct: 702 ERIARKPTVCESEIRAYMRQVLEGIHYLHQSHVLHLDVKPENLLVWDGAAGEQQVRICDF 761 

Query: 1508 GNAQSLSQEKVLPSDKFKDYLETMAPELLEGQGAVPQTDIWAIGVTAFIMLSAEYPVSSE 1567 

GNAQ L+ + P E +APE++ TDIW +GV AF+ L+ P E 

Sbjct: 762 GNAQELTPGE--PQYCQYGTPEFVAPEIVNQSPVSGVTDIWPVGWAFLCLTGISPFVGE 819 

Query: 1568 GARDLQRGLRKGLVRLSR-CYAGLSGGAVAFLRSTLCAQPWGRPCASSCLQCPWLTEEGP 1626 

R +R V + LS A FL L Q RP A L+ PW + 

Sbjct: 820 NDRTTLMNIRNYNVAFEETTFLSLSREARGFLIKVL-VQDRLRPTAEETLEHPWFKTQ-- 876 

Query: 1627 ACSRPAPVTFPTARLRVFV-RNREKRRALLYKRH 1659 

++ A V+ T L++F+ R R +R + YK H 
Sbjct: 877 --AKGAEVS--TDHLKLFLSRRRWQRSQISYKCH 906 (SEQ ID N0:6) 



Score = 145 bits (362) , Expect = 4e-33 

Identities - 85/253 (33%), Positives = 135/253 (52%), Gaps = 5/253 (1%) 

Query : 165 YEVKEE IGRGVFGFVKRVQHKGNKI LCAAKFI PLRSRTRAQAYRERDILAALSHPLVTGL 224 

Y EE RG FG V+ + AK +P + + + +E ++L L H + L 

Sbjct: 1985 YTFLEEKARGRFGWRACRENATGRTFVAKIVPYAAEGKPRVLQEYEVLRTLHHERIMSL 2044 

Query: 225 LDQFETRKTLILILELCSSEELLDRLYRKGWTEAEVKVYIQQLVEGLHYLHSHGVLHLD 284 

+ + T + L+LI E C + ELL L + +E +V Y+ QL++GL YLH H VLHLD 
Sbjct: 2045 HEAYITPRYLVLIAESCGNRELLCGLSDRFRYSEDDVATYMVQLLQGLDYLHGHHVLHLD 2104 

Query: 285 IKPSNILMVHPAREDIKICDFGFAQNITPAELQ--FSQYGSPEFVSPEIIQQNPVSEASD 342 

IKP N+L+ +KI DFG AQ P L+ + G+ EF++PE+++ P+ A+D 

Sbjct: 2105 IKPDNLLLA--PDNALKIVDFGSAQPYNPQALRPLGHRTGTLEFMAPEMVKGEPIGSATD 2162 

Query: 343 IWAMGVISYLSLTCSSPFAGESDRATLLNVLEGRVSWSSPMAAHLSEDAKDFIKATLQRA 402 

IW GV++Y+ L+ SPF + T ++ GR + + + S+ A F++ L 

Sbjct: 2163 IWGAGVLTYIMLSGRSPFYEPDPQETEARIVGGRFD-AFQLYPNTSQSATLFLRKVLSVH 2221 

Query: 403 PQARPSAAQCLSH 415 

P +RPS+ + H 
Sbjct: 2222 PWSRPSSCLSVCH 2234 (SEQ ID NO: 7) 



Score = 128 bits (319) , Expect = 4e-28 

Identities = 81/245 (33%), Positives = 120/245 (48%), Gaps = 19/245 (7%) 

Query: 1139 PRKKPGLASFRLSGL -KSWDRAPTFLRELSDETWLGQSVTLACQVSAQP 1187 

PRK GL+ LS D P F +L D+ ++ G++ TL C +A P 

Sbjct: 1571 PRKDKGLSPPNLSASVQEELGHQYVRSESDFPPVFHIKLKDQVLLEGEAATLLCLPAACP 1630 

Query: 1188 AAQATWSKDGAPLESSSRVLISATLKNFQLLTILWVAEDLGVYTCSVSNALGTVTTTGV 1247 

A +W KD L S V+I + QLL+I G+Y CS +N LG++T++ 

Sbjct: 1631 APHISWMKDKKSLRSEPSVIIVSCKDGRQLLSIPRAGKRHAGLYECSATNVLGSITSSCT 1690 

Query: 1248 LRKAERPSSSPCPDIGEVYADGVLLVWKPVESYGPVTYIVQCSLEGGS-WTTLASDIFDC 1306 

+ A P P++ + Y D L++WKP +S P TY ++ ++G S W ++S I DC 

Sbjct: 1691 VAVARVPGKLAPPEVTQTYQDTALVLWKPGDSRAPCTYTLERRVDGESVWHPVSSGIPDC 1750 

Query: 1307 CYLTSKLSRGGTYTFRTACVSKAGMGPYSSPSEQVLLGG PSHLASEEESQGRS 1359 

Y + L G T FR AC ++AG GP+S+ SE+V + G PS E R 

Sbjct: 1751 YYNVTHLPVGVTVRFRVACANRAGQGPFSNSSEKVFVRGTQDSSAVPSAAHQEAPVTSRP 1810 

Query: 1360 AQPLP 1364 



FIGURE 2E 



Docket No.: CL000927-CIP-DIV2 
Serial No.: To be assigned 
Inventors: Ming-HulWEI tal. 
Title: ISOI-ATED HUMAN KINASE PROTEINS... 

A+ P 

Sbjct: 1811 ARARP 1815 (SEQ ID NO: 8) 



Score = 71.0 bits (171), Expect = 9e-ll 

Identities = 41/115 (35%) , Positives = 57/115 (48%) , Gaps = 4/115 (3%) 

Query: GO lEDVQAQTGGTAQFEAIIEGDPQPSVTWYKDSVQLVDSTRLSQQQEGTTYSLVLRHVASK 119 

+EDV+ G A+F+ I G P P VTW + +S L +Q+G +SL + HV S+ 

Sbjct: 89 LEDVEVLEGRAARFDCKISGTPPPWTWTHFGCPMEESENLRLRQDGGLHSLHIAHVGSE 148 

Query: 120 DAGVYTCLAQNTGGQVLCKAELLVLGGDNEPDSEKQSHRRKLHSFYEVKEEIGRG 174 

D G+Y A NT GQ C A+L V EP + KL + EE +G 

Sbjct: 149 DEGLYAVSAVNTHGQAHCSAQLYV EEPRTAASGPSSKLEKMPSIPEEPEQG 199 (SEQ ID NO: 9) 



Score = 60.1 bits (143), Expect = 2e-07 

Identities = 54/199 (27%), Positives ^ 81/199 (40%), Gaps = 12/199 (6%) 

Query: 1160 PTFLRELSDETWLGQSVTLACQVSAQPAAQATWSKDGAPLESSSRVLISATLKNFQLLT 1219 

P FLR L D V L + L CQV+ P +W +G ++SS ++ ++ L 
Sbjct: 207 PDFLRPLQDLEVGLAKEAMLECQVTGLPYPTISWFHNGHRIQSSDDRRMT-QYRDVHRLV 265 

Query: 1220 ILVWAEDLGVYTCSVSNALGTVTTTGVLRKAERPSSSP- -CPDIGEVYADGVLLVWKPV 1277 

V + GVY ++N LG L + PP+V VLWP 

Sbjct: 266 FPAVGPQHAGVYKSVIANKLGKAACYAHLYVTDWPGPPDGAPQWAVTGRMVTLTWNPP 325 

Query: 1278 ESY GPVTYIVQCSLEGG-SWTTLASDIFDCCYLTSKLSRGGTYTFRTACVSKAG 1330 

S +TY VQ+G WTL+++++L+G+FR + 

Sbjct: 326 RSLDMAIDPDSLTYTVQHQVLGSDQWTALVTGLREPGWAATGLRKGVQHIFRVLSTTVKS 385 

Query: 1331 MGPYSSPSE--QVLLGGPS 134 7 

S PSE Q+L GP+ 
Sbjct: 386 SSKPSPPSEPVQLLEHGPT 404 (SEQ ID NO: 10) 



Score =45.7 bits (106), Expect = 0,004 

Identities = 30/102 (29%) , Positives = 45/102 (43%) , Gaps = 1/102 (0%) 

Query: 115 9 APTFLRELSDETWLGQSVTLACQVSAQPAAQATWSKDGAPLESSSRVLISATLKNFQLL 1218 

AP F R L D V+ G++ C++S P TW+ G P+E S + + L 
Sbjct: 82 APLFTRLLEDVEVLEGRAARFDCKISGTPPPWTWTHFGCPMEESENLRLRQD-GGLHSL 140 

Query: 1219 TILVWAEDLGVYTCSVSNALGTVTTTGVLRKAERPSSSPCP 1260 

I V +ED G+Y S N G + L E +++ P 
Sbjct: 141 HIAHVGSEDEGLYAVSAVNTHGQAHCSAQLYVEEPRTAASGP 182 (SEQ ID N0:11) 



Score =43.8 bits (101), Expect = 0.015 

Identities = 58/217 (26%), Positives = 84/217 (37%), Gaps = 23/217 (10%) 

Query: 619 RVLEEEAAREEQATLLAKAPSFETALRLPASGTHLAPGHSHSLEHDSPSTPRPSSEACGE 678 

R ++ +A A A S R P S T LAP + + T PSS 

Sbjct: 1788 RGTQDSSAVPSAAHQEAPVTSRPARARPPDSPTSLAPPIiAPAAPTPPSVTVSPSSPPTPP 1847 

Query: 679 AQRLPSAPSGGAPIRDMGHPQGSKQLPSTGGHPGTAQPERPSPDSPWGQPAPFCHPKQ6S 738 

+QLS +GP+ P+ +L+ A+P PS +P PF 
Sbjct: 1848 SQALSSLKAVGPPPQTP--PRRHRGLQAAR PAEPTLPSTHVTPSEPKPFVLD 1897 

Query: 739 APQEGCSPHPAVAPCPPGSFPPGSCKEAPLVPSSPFLGQPQAPPAPAKASPPIiDSKMGPG 798 

+ P A P G P S P+ + F+ P AP PA PP +K+ 
Sbjct: 1898 TGTPIPASTPQGVKPVSS--STPVYVVTSFVSAPPAPEPPAPEPPPEPTKVTVQ 1949 

Query: 799 DISLPGRPKPGPCSSPGSASQAS-SSQVSSLRVGSSQ 834 

+S P SSPGS+ ++S + ++LR G Q 

Sbjct: 1950 SLS PAKEWSSPGSSPRSSPRPEGTTLRQGPPQ 1982 (SEQ ID N0:12) 



Score =43.0 bits (99), Expect = 0.026 

Identities = 25/92 (27%), Positives = 44/92 (47%), Gaps = 4/92 (4%) 



FIGURE 2F 



Docket No,: CL000927.CIP-DIV2 
Serial No.: To be assigned 
Inventors: Ming-Hul WEI et al. 
Title: ISOLATED HUMAN KINASE PROTEINS... 

Query: 54 PSMQVTIEDVQAQTGGTAQFEAIIEGDPQPSVTWYKDS--VQLVDSTRLSQQQEGTTYSL 111 

P ++D++ A E + G P P+++W+ + +Q D R++Q ++ + L 

SbjCt: 207 PDFLRPLQDLEVGIiAKEAMLECQVTGLPYPTISWFHNGHRIQSSDDRRMTQyRD--VHRL 264 

Query: 112 VLRHVASKDAGVYTCLAQNTGGQVLCKAELLV 143 

V V + AGVY + N G+ C A L V 
Sbjct: 265 VFPAVGPQHAGVYKSVIANKLGKAACYAHLYV 296 (SEQ ID NO: 13) 



>gi| 8928460| sp|075962|TRIO_HUMAN TRIPLE FUNCTIONAL DOMAIN PROTEIN 
(PTPRF INTERACTING PROTEIN) >gi | 3644048 | gb | AAC43042 . 1 ) 
(AF091395) Trio isofoirm [Homo sapiens] 
Length = 3038 

Score =s 229 bits (579) , Expect = le-58 

Identities = 143/418 (34%), Positives = 215/418 (51%), Gaps = 11/418 (2%) 

Query: 53 PPSMQVTIEDVQAQTGGTAQFEAIIEGDPQPSVTWYKDSVQLVDST RLSQQQEGTTY 109 

PP + + +V +TG T + G P+ S+TW +++ +S G 

Sbjct: 2625 PPEFVIPLSEVTCETGETWLRCRVCGRPKASITWKGPEHNTLNNDGHYSISYSDLGEA- 2683 

Query: 110 SLVLRHVASKDAGVYTCLAQNTGGQVLCKAELLVLGGDNEPDSEKQSHRRKLHSFYEVKE 169 

+L + V ++D G+YTC+A N G A L VLG D + + SFY 

Sbjct: 2684 TLKIVGVTTEDDGIYTCIAVNDMGSASSSASLRVLGPGM--DGIMVTWKDNFDSFYSEVA 2741 

Query: 170 EIGRGVFGFVKRVQHKGNKILCAAKFIPLRSRTRAQAYRERDILAALSHPLVTGLLDQFE 229 

E+GRG F VK+ KG K A KF+ + R Q E IL +L HPL+ GLLD FE 
Sbjct: 2742 ELGRGRFSVVKKCDQKGTKRAVATKFVNKKLMKRDQVTHELGILQSLQHPLLVGLLDTFE 2801 

Query: 230 TRKTLILILELCSSEELLDRLYRKGWTEAEVKVYIQQLVEGLHYLHSHGVLHLDIKPSN 289 

T + IL+LE+ LLD + R G +TE +++ ++ +++E + YLH+ + HLD+KP N 

Sbjct: 2802 TPTSYILVLEMADQGRLLDCWRWGSLTEGKIRAHLGEVLEAVRYLHNCRIAHLDLKPEN 2861 

Query: 290 ILMVHP-AREDIKICDFGFAQNITPAELQFSQYGSPEFVSPEIIQQNPVSEASDIWAMGV 348 

IL+ A+ IK+ DFG A + G+PEF +PEII NPVS SD W++GV 

Sbjct: 2862 ILVDESLAKPTIKLADFGDAVQLNTTYYIHQLLGNPEFAAPEIILGNPVSLTSDTWSVGV 2921 

Query: 349 ISYLSLTCSSPFAGESDRATLLNVLEGRVSWSSPMAAHLSEDAKDFIKATLQRAPQARPS 408 

■i+Y+ L+ SPF +S T LN+ S+ +S+ AK+F+ LQ P RPS 

Sbjct: 2922 LTYVLLSGVSPFLDDSVEETCLNICRLDFSFPDDYPKGVSQKAKEFVCFLLQEDPAKRPS 2981 

Query: 409 T^QCLSHPWFLKSMPAEEAHFINTKQLKFLLARSRWQ RSLMSYKSILVMRSIPEL 463 

AA L W L++ ++T +L +R+Q R+SK+L R+P+ 

Sbjct: 2982 AALALQEQW-LQAGNGRST6VLDTSRLTSFIERRKHQNDVRPIRSIKNFLQSRLLPRV 3038 
(SEQ ID N0:14) 

Score = 121 bits (300) , Expect =: 7e-26 

Identities = 82/280 (29%), Positives « 137/280 (48%), Gaps = 10/280 (3%) 

Query: 1374 QIQRGRFSWRQCWEKASGRALAAKIIPYHPKDKTAVLREYEALKGLRHPHLAQLHAAYL 1433 

++ RGRFSW++C +K + RA+A K + + V E L+ L+HP L L + 

Sbjct: 2742 ELGRGRFSWKKCDQKGTKRAVATKFTOKKLMKRDQVTHELGILQSLQHPLLVGLLDTFE 2801 

Query: 1434 SPRHLVLILELCSGPELLPCLAERASYSESEVKDYLWQMLSATQYLHNQHILHLDLRSEN 1493 

+P +L+LE+ LL C+ S +E +++ +L ++L A +YLHN I HIjDL+ EN 

Sbjct: 2802 TPTSYILVLEMADQGRLLDCWRWGSLTEGKIRAHLGEVLEAVRYLHNCRIAHLDLKPEN 2861 

Query: 1494 MUTE YNLLKWDLGNAQSLSQEKVLPSDKFKDYLETMAPELLEGQGAVPQTDIWAI 1550 

+++ E +K+ D G+A L+ + + E APE++ G +D W++ 

Sbjct: 2862 ILVDESIiAKPTIKIiADFGDAVQLNTTYYI--HQLIjGNPEFAAPEIIIjGNPVSLTSDTWSV 2919 

Query: 1551 GVTAFIMLSAEYPVSSEGARDLQRGL-RKGLVRLSRCYAGLSGGAVAFLRSTLCAQPWGR 1609 

GV +++LS p + + + R + G+S A F+ h PR 

Sbjct: 2920 GVLTYVLLSGVSPFLDDSVEETCLNICRLDFSFPDDYFKGVSQKAKEFVCFLLQEDPAKR 2979 

(Juery: 1610 PCASSCLQCPWLTEEGPACSRPAPVTFPTARLRVFVRNRE 1649 

P A+ LQ WL A + + T+RL P+ R+ 

Sbjct: 2980 PSAALALQEQWL QAGNGRSTGVLDTSRLTSFIERRK 3015 (SEQ ID NO: 15) 



FIGURE 2G 



Dock t No.: CL000927-CIP-DIV2 
Serial No.: To b assigned 
Inventors: MIng-Hul WEI et al. 
Title: ISOLATED HUMAN KINASE PROTEINS... 

Score = 55.4 bits (131), Expect = 5e-06 

Identities = 42/153 (27%) , Positives = 70/153 (45%) , Gaps = 17/153 (11%) 

Query: 1128 GRPEGLEKEGPPRKKPGLASFRLSGLKS WDRAPTFLRELSDETWLGQSVTLACQV 1183 

G+ EG + G + + GL++ li + +D P F+ LS+ T G++V L C+V 

Sbjct: 2590 GKREGKLENGYRKSREGLSNKVSVKLLNPNYIYDVPPEFVIPLSEVTCETGETVVLRCRV 2649 

Query: 1184 SAQPAAQATW-SKDGAPLESSSRVLISATLKNFQLLTILVWAEDLGVYTCSVSNALGTV 1242 

+P A TW + L + IS + L 1+ V ED G+YTC N +G+ 

Sbjct: 2650 CGRPKASITWKGPEHNTLNNDGHYSISYSDLGEATLKIVGVTTEDIX3IYTCIAVNDMGS 2709 

Query: 1243 TTTGVLRKAERPSSSPCPDIGEVYADGVLLVWK 1275 

+++ LR + DG+++ WK 

Sbjct: 2710 SSSASLR VLGPGMDGIMVTWK 2730 (SEQ ID NO: 16) 



Score =39.1 bits (89), Expect =0.39 

Identities = 61/208 (29%), Positives = 76/208 (36%), Gaps = 65/208 (31%) 

Query: 688 GGAPIRDMGHPQGSKQLPSTGGHPGTA QPERPSPD S 723 

GGAP GH G S GG P T+ QP R P S 

Sbjct: 2252 GGAPSG6SGHSGGPS---SCGGAPSTSRSRPSRIPQPVRHHPPVLVSSAASSQAEADKMS 2308 

Query: 724 PWGQPAPFCHPKQGSAPQEGCSPHPAVAPCPPGSFPPGSCKEAPLVPSSPFLGQPQ 779 

P P P G+AP+ G S A + PPG+ GS +EA +P L P+ 
Sbjct: 2309 GTSTPGPSL-PPPGAAPEAGPS---APSRRPPGADAEGSEREAEPIPKMKVLESPRKGAA 2364 

Query: 780 APPAPAK ASPPLDSKMGPGDISLPGRPKPGPCSSPGSA 817 

+P APAK A+ PL+S + SL P P PS 
Sbjct: 2365 NASGSSPDAPAKDARASLGTLPLGKPRAGAASPLNSPLSSAVPSLGKEPFP PSSP 2419 

Query: 818 SQASSSQVSSLRVG-SSQVG--TEPGPS 842 

Q S SS+ +S+ G T PG S 
Sbjct: 2420 LQKGGSFWSSIPASPASRPGSFTFPGDS 2447 (SEQ ID N0:17) 

>gii302408l|sp|Q15746|KMLS__HUMAN MYOSIN LIGHT CHAIN KINASE, SMOOTH 

MUSCLE AND NON-MUSCLE ISOZYMES (MLCK) [CONTAINS: TELOKIN] 
Length = 1913 

Score = 206 bits (518) , Expect = 2e-51 

Identities = 104/298 (34%), Positives = 173/298 (57%), Gaps = 2/298 (0%) 

Query: 159 RKLHSFYEVKEEIGRGVFGFVKRVQHKGNKILCAAKFIPLRSRTRAQAYRER-DILAALS 217 

+K+ FY+++E +G G FG V R+ K + + A KF S + R+ 1+ L 
Sbjct: 1458 QKVSDFYDIEERLGSGKFGQVFRLVEKKTRKVWAGKFFKAYSAKEKENIRQEISIMNCLH 1517 

Query: 218 HPLVTGLLDQFETRKTLILILELCSSEELLDRLYRKGV-VTEAEVKVYIQQLVEGLHYLH 276 

HP + +D FE + ++++LE+ S EL +R+ + +TE E Y++Q+ EG+ Y+H 
Sbjct: 1518 HPKLVQCVDAFEEKANIVMVLEIVSGGELFERIIDEDFELTERECIKYMRQISEGVEYIH 1577 

Query: 277 SHGVLHLDIKPSNILMVHPAREDIKICDFGFAQNITPAELQFSQYGSPEFVSPEIIQQNP 336 

G++HLD+KP NI+ V+ IK+ DFG A+ + A +G+PEFV+PE+I P 

Sbjct: 1578 KQGIVHLDLKPENIMCVNKTGTRIKLIDFGLARRLENAGSLKVLFGTPEFVAPEVINYEP 1637 

Query: 337 VSEASDIWAMGVISYLSLTCSSPFAGESDRATLLNVLEGRVSWSSPMAAHLSEDAKDFIK 396 

+S A+D+W++GVI Y+ ++ SPF G++D TL NV + +S+DAKDFI 

Sbjct: 1638 ISYATDMWSIGVICYILVSGLSPFMGDNDNETLANVTSATWDFDDEAFDEISDDAKDFIS 1697 

Query: 397 ATLQRAPQARPSAAQCLSHPWFLKSMPAEEAHFINTKQLKFLLARSRWQRSLMSYKSI 454 

L++ + R QCL HPW +K EA ++ ++K +AR +WQ++ + ++I 

Sbjct: 1698 NLLKKDMKNRLDCTQCLQHPWLMKDTKNMEAKKLSKDRMKKYMARRKWQKTGNAVRAI 1755 
(SEQ ID N0:18) 

Score = 127 bits (315), Expect = le-27 

Identities = 134/528 (25%), Positives = 219/528 (41%), Gaps = 55/528 (10%) 

Query: 1132 GLEKEGPPRKKPGLASFRLSGLKSWDRAPTFLRELSDETWLGQSVTLACQVSAQPAAQA 1191 

G E + +KKP + + + P ++ D+ V G+SV L +V+ 

Sbjct: 1215 GTESDATVKKKPAPKTPPKAAMP PQIIQFPEDQKVRAGESVELFGKVTGTQPITC 1269 



FIGURE 2H 



Dock tNo.: CL000927-CIP-DIV2 
S rial No.: To be assigned 
Inventors: Ming-Hul WEt et al. 
Title: ISOLATED HUMAN KINASE PROTEINS... 

Query: 1192 TWSKDGAPLESSSRVLISATLKNFQLLTILVWAEDLGVYTCSVSNALGT VTTTGV 124 7 

TW K ++ S + + + +N LTIL E G YT V N IiG+ V T V 
Sbjct: 1270 TWMKFRKQIQDSEHIKVENS-ENGSKLTILAARQEHCGCYTLLVENKLGSRQAQVNLT-V 1327 

Query: 1248 LRKAERPSSSPCPDIGEVYADGVLLVWKPVESYGPVTYIVQCSLE GGSWTTLASD 1302 

+ K + P+ +PC ++ + + L W SY + + S+E +W LA+ 

Sbjct: 1328 VDKPDPPAGTPCAS--DIRSSSLTLSWYG-SSYDGGSAVQSYSIEIWDSANKTWKELAT- 1383 

Query: 1303 IFDCCYLTS KLSRGGTYTFRTACVSKAGMGPYSSPSEQVLLGGPSHLAS 1351 

C TS L Y FR ++ G S SE +G 

Sbjct: 1384 CRSTSFNVQDLLPDHEYKFRVRAINVYGTSEPSQESELTTVGEKPEEPKMKWRCQT 1439 

Query: 1352 EEESQGRSAQPLPSTKTFAF QTQIQRGRFSWRQCWEKASGRALAAKIIP-YH 1403 

E E R+ K F + ++ G+F V+EK + +AK Y 

Sbjct: 1440 DDEKEPEVDYRTVTINTEQKVSDFYDIEERLGSGKFGQVFRLVEKKTRKVWAGKFFKAYS 1499 

Query: 1404 PKDKTAVLREYEALKGLRHPHLAQLHAAYLSPRHLVLILELCSGPELLP-CLAERASYSE 1462 

K+K + +E + L HP L Q A+ ++V++LE+ SG EL + E +E 
Sbjct: 1500 AKEKENIRQEISIMNCLHHPKLVQCVDAFEEKANIVMVLEIVSGGELFERIIDEDFELTE 1559 

Query: 1463 SEVKDYLWQMLSATQYLHNQHILHLDLRSENMIITEY- -NLLKWDLGNAQSLSQE K 1517 

E Y+ Q+ +Y+H Q I+HLDL+ EN++ +K++D G A+ L K 

Sbjct: 1560 RECIKYMRQISEGVEYIHKQGIVHLDLKPENIMCVNKTGTRIKLIDFGLARRLENAGSLK 1619 

Query: 1518 VLPSDKFKDYLETMAPELLEGQGAVPQTDIWAIGVTAFIMLSAEYPVSSEGARDLQRGLR 1577 

VL E +APE++ + TD+W+IGV +I++S p + + + 

Sbjct: 1620 VLFGTP EFVAPEVINYEPISYATDMWSIGVICYILVSGLSPFMGDNDNETLANVT 1674 

Query: 1578 KGLVRL-SRCYAGLSGGAVAFLRSTLCAQPWGRPCASSCLQCPWLTEE 1624 

+ +S A F+ + L R + CLQ PWL ++ 

Sbjct: 1675 SATWDFDDEAPDEISDDAKDFISNLLKKDMKNRLDCTQCLQHPWLMKD 1722 (SEQ ID N0:19) 

Score =s 64.4 bits (154), Expect = 9e-09 

Identities = 36/106 (33%), Positives = 52/106 (48%), Gaps = 4/106 (3%) 

Query: 54 PSMQVTIEDVQAQTGGTAQFEAIIEGDPQPSVTWYKDSVQLVDSTRLS-QQQEGTTYSLV 112 

P TI D++ G A+F+ lEG P P V W+KD + +S E SL+ 

Sbjct: 1808 PYFSKTIRDLEWEGSAARFDCKIEGYPDPEWWFKDDQSIRESRHFQIDYDEDGNCSLI 1867 

Query: 113 LRHVASKDAGVYTCLAQNTGGQVLCKAELLV LGGDNEPDSEKQ 155 

+ V D YTC A N+ G+ C AEL+V G+ E + E++ 

Sbjct: 1868 ISDVCGDDDAKYTCKAVNSLGEATCTAELIVETMEEGEGEGEEEEE 1913 (SEQ ID NO:20) 

Score = 64.0 bits (153), Expect = le-08 
Identities = 35/96 (36%), Positives = 46/96 (47%) 

Query: 53 PPSMQVTIEDVQAQTGGTAQFEAIIEGDPQPSVTWYKDSVQLVDSTRLSQQQEGTTYSLV 112 

PP + V + G +F I G PQP VTW K +V L S R+S ++ L 

Sbjct: 160 PPKFATKLGRVWKEGQMGRFSCKITGRPQPQVTWLKGNVPLQPSARVSVSEKNGMQVLE 219 

Query: 113 LRHVASKDAGVYTCLAQNTGGQVLCKAELLVLGGDN 148 

+ V D GVYTCL N G+ AEL + G D+ 
Sbjct: 220 IHGVNQDDVGVYTCLWNGSGKMMSAELSIQGLDS 255 (SEQ ID NO: 21) 

Score = 59.3 bits (141), Expect = 3e-07 

Identities « 30/100 (30%), Positives = 50/100 (50%), Gaps = 3/100 (3%) 

Query: 47 LPALPGPPSMQVTIE DVQAQTGGTAQFEAIIEGDPQPSVTWYKDSVQLVDSTRLSQQ 103 

LP P P+ + ++ D++ G + G+P P V W + ++ +S + 

Sbjct: 613 LPVAPSKPTAPIPLQGLSDLKVMDGSQVTMTVQVSGNPPPEVIWLHNGNEIQESEDFHFE 672 

Query: 104 QEGTTYSLVLRHVASKDAGVYTCLAQNTGGQVLCKAELLV 143 

Q GT +SL ++ V +D G YTC A N+ G+V +A L V 
Sbjct: 673 QRGTQHSLWIQEVFPEDTGTYTCEAWNSAGEVRTQAVLTV 712 (SEQ ID NO: 22) 



FIGURE 21 
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Score = 57.4 bits (136), Expect = le-06 

Identities = 32/89 (35%), Positives = 46/89 (50%), Gaps = 1/89 (1%) 

Query: 1160 PTFLRELSDETVVLGQSVTLACQVSAQPAAQATWSKDGAPLESSSRVLISATLKNFQLLT 1219 

P F +L V GQ +C+++ +P Q TW K PL+ S+RV +S Q+L 
Sbjct: 161 PKFATKLGRWVKEGQMGRFSCKITGRPQPQVTWLKGNVPLQPSARVSVSEK-NGMQVLE 219 

Query: 1220 ILVWAEDLGVYTCSVSNALGTVTTTGVL 1248 

I V +D+GVYTC V N G + + L 
Sbjct: 220 IHGVNQDDVGVYTCLWNGSGKASMSAEL 248 (SEQ ID NO:23) 



Score =53.5 bits (126), Expect = 2e-05 

Identities = 32/98 (32%) , Positives = 46/98 (46%) , Gaps = 4/98 (4%) 

Query: 1159 APTFLRELSDETWLGQSVTLACQVSAQPAAQATWSKDGAPLESSSRVLISATLKNFQLL 1218 

AP+F L D V+ GQ L C V P + TW +G P++ + + + L 
Sbjct: 513 APSFSSVLKDCAVIEGQDFVLQCSVRGTPVPRITWLLNGQPIQYARSTCEAGVAE L 568 

Query: 1219 TILVWAEDLGVYTCSVSNALGTVTTTGVLRKAERPSS 1256 

I + ED G YTC NALG V+ + + E+ SS 
Sbjct: 569 HIQDALPEDHGTYTCLAENALGQVSCSAWVTVHEKKSS 606 (SEQ ID N0:24) 



Score = 53.1 bits (125), Expect = 2e-05 

Identities = 37/113 (32%), Positives = 48/113 (41%), Gaps = 1/113 (0%) 

Query: 114 0 RKKPGLASFRLSGLKSWDRAPTFLRELSDETWLGQSVTLACQVSAQPAAQATWSKDGAP 1199 

+K + + L S AP FL+ LSD V+ G VT+ QVS P + W +G 
Sbjct: 603 KKSSRKSEYLLPVAPSKPTAPIFLQGLSDLKVMDGSQVTMTVQVSGNPPPEVIWLHNGNE 662 

Query: 1200 LESSSRVLISATLKNFQLLTILVWAEDLGVYTCSVSNALGTVTTTGVLRKAE 1252 

++ S L I V ED G YTC N+ G V T VL E 

Sbjct: 663 IQESEDFHFEQRGTQHS-LWIQEVFPEDTGTYTCEAWNSAGEVRTQAVLTVQE 714 (SEQ ID NO:25) 



Score =51.9 bits (122), Expect = Se-OS 

Identities = 34/101 (33%), Positives = 50/101 (48%), Gaps = 2/101 (1%) 

Query: 46 SLPALPGPPSMQVTIEDVQAQTGGTAQFEAIIEGDPQPSVTWYKDSVQLVDSTR-LSQQQ 104 

S+P L P+ + ++ + G TA+FE + G P+P VTW+++ + R L 
Sbjct: 26 SMP-LTEAPAFILPPRNLCIKEGATAKFEGRVRGYPEPQVTWHRNGQPITSGGRFLLDCG 84 

Query: 105 EGTTYSLVLRHVASKDAGVYTCLAQNTGGQVLCKAELLVLG 145 

T+SLV+ V +D G YTC A N G EL V G 

Sbjct: 85 IRGTFSLVIHAVHEEDRGKYTCEATNGSGARQVTVELTVEG 125 (SEQ ID NO:26) 



Score =50.8 bits (119), Expect = le-04 

Identities = 41/182 (22%), Positives = 65/182 (35%), Gaps = 26/182 (14%) 

Query: 1130 PEGLEKEGPPRKKPGLASFRLSGLKSWDRA PTFLRELSDETV 1171 

P G E++ P+P RGLSD PF++V 

Sbjct: 366 PSGEERKRPAPPRPATFPTRQPGLGSQDWSKAANRRIPMEGQRDSAFPKFESKPQSQEV 425 

Query: 1172 VLGQSVTLACQVSAQPAAQATWSKDGAPLESSSRVLISATLKNFQLLTILVWAEDLGVY 1231 

Q+V C+VS P + W +G P+ + h +h D G Y 

Sbjct: 426 KENQTVKFRCEVSGIPKPEVAWFLEGTPVRRQEGSIEVYEDAGSHYLCLLKARTRDSGTY 485 

Query: 1232 TCSVSNALGTVTTTGVLRKAERPSSSPCPDIGEVYADGVLLVWKPVESYGPVTYIVQCSL 1291 

+C+ SNA G V+ + L+ P V D ++ + +++QCS+ 
Sbjct: 486 SCTASNAQGQVSCSWTLQVERLAVMEVAPSFSSVLKDCAVIEGQ DFVLQCSV 537 

Query: 1292 EG 12 93 
G 

Sbjct: 538 RG 539 (SEQ ID NO: 27) 



Score = 50.4 bits (118), Expect = 2e-04 

Identities = 26/100 (26%), Positives = 47/100 (47%), Gaps = 3/100 (3%) 



FIGURE 2J 
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Query: 54 PSMQVTIEDVQAQTGGTAQFEAIIEGDPQPSVTWYKDSVQLVDSTRLSQQQEGTTYSLVL 113 

P+ + ++DV G ++DP++W + L + + QEG+ S+ + 

Sbjct: 1098 PAFKQKLQDVHVAEGKKLLLQCQVSSDPPATIIWTLNGKTLKTTKFIILSQEGSLCSVSI 1157 

Query: 114 RHVASBCDAGVYTCLAQNTGGQVLCKAELLVLGGDNEPDSE 153 

+D G+Y C+A+N GQ C ++ V D+ P SE 
Sbjct: 1158 EKALLEDRGLYKCVAKNDAGQAECSCQVTV---DDAPASE 1194 (SEQ ID NO: 28) 



Score = 50.0 bits (117), Expect « 2e-04 

Identities = 35/125 (28%), Positives = 59/125 (47%), Gaps = 16/125 (12%) 

Query: 1154 KSWDIWIPTFLRELSDETVVLGQSVTLACQVSAQPAAQATWSKDGAPLESSSRVLISATLK 1213 

+S AP F ++L D V G+ + L CQVS+ PA W+ +G L+++ +++S 
Sbjct: 1092 ESQGTAPAFKQKLQDVHVAEGKKLLLQCQVSSDPPATIIWTLNGKTLKTTKFIILSQE-G 1150 

(^lery: 1214 NFQLLTILVWAEDLGVYTC SVSNALGTVTTTGVLRKAERPSSSP 1258 

+ ++I + ED G+Y C +V +A + T K+ RP SS 

Sbjct: 1151 SLCSVSIEKALLEDRGLYKCVAKNDAGQAECSCQVTVDDAPASENTKAPEMKSRRPKSSL 1210 

Query: 1259 CPDIG 1263 
P +G 

Sbjct: 1211 PPVLG 1215 (SEQ ID NO:29) 



Score = 48.0 bits (112), Expect = 8e-04 
Identities = 26/87 (29%) , Positives = 38/87 (42%) 

Query: 1159 APTFLRELSDETWLGQSVTLACQVSAQPAAQATWSKDGAPLESSSRVLISATLKNFQLL 1218 

AP F+ + + G + +V P Q TW ++G P+ S R L+ ++ L 

Sbjct: 32 APAFILPPRNLCIKEGATAKFEGRVRGYPEPQVTWHRNGQPITSGGRFLLDCGIRGTFSL 91 

Query: 1219 TILVWAEDLGVYTCSVSNALGTVTTT 1245 

I V ED G YTC +N G T 
Sbjct: 92 VIHAVHEEDRGKYTCEATNGSGARQVT 118 (SEQ ID NO: 30) 



Score = 45.3 bits (105), Expect = 0.005 

Identities = 37/140 (26%), Positives s 54/140 (38%), Gaps ^ 23/140 (16%) 

Query: 22 TNHPSMVGCGWHPGLCGWGGGLHSSLPALPGPPSMQVTIEDVQAQTGGTAQFEAIIEGDP 81 

+N V C W + L + PS ++D G + + G P 

Sbjct: 490 SNAQGQVSCSWTLQV ERLAVMEVAPSFSSVLKDCAVIEGQDFVLQCSVRGTP 541 

Query: 82 QPSVTWYKDS--VQI1VDSTRLSQQQEGTTYSLVLRHVASKDAGVYTCLAQNTGGQVLCKA 139 

P +TW + +Q ST E L ++ +D G YTCLA+N GQV C A 

Sbjct: 542 VPRITWLLNGQPIQYARSTC EAGVAELHIQDALPEDHGTYTCLAENALGQVSCSA 596 

Query: 140 ELLVLGGDNEPDSEKQSHRR 159 

+ V EK+S R+ 

Sbjct: 597 WVTV HEKKSSRK 608 (SEQ ID NO: 31) 



Score =44.5 bits (103), Expect = 0.009 

Identities = 26/104 (25%), Positives « 44/104 (42%), Gaps = 7/104 (6%) 

Query: 41 GGLHSSLPALPGPPSMQVTIEDVQAQTGGTAQFEAIIEGDPQPSVTWYKDSVQLV-DSTR 99 

GS+P PQ ++T+F+G P+P V W+ + + 

Sbjct: 407 GQRDSAFPKFESKPQSQ EVKENQTVKFRCEVSGIPKPEVAWFLEGTPVRRQEGS 460 

Query: 100 LSQQQEGTTYSLVLRHVASKDAGVYTCLAQNTGGQVLCKAELLV 143 

+ ++ ++ L L ++D+G Y+C A N GQV C L V 
Sbjct: 461 lEVYEDAGSHYLCLLKARTRDSGTYSCTASNAQGQVSCSWTLQV 504 (SEQ ID NO: 32) 



Score =44.1 bits (102), Expect = 0.012 

Identities = 26/82 (31%), Positives = 38/82 (45%), Gaps = 1/82 (1%) 
Query: 63 VQAQTGGTAQFEAIIEGDPQPSVTWYKDSVQLV-DSTRLSQQQEGTTYSLVLRHVASKDA 121 
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V A G + I GDP P+V W +D L D+ Q ++LVL+ V A 

Sbjct: 730 VTASLGQSVLISCAIAGDPFPTVHWLRDGKALCKDTGHFEVI^NEDVFTLVLKKVQPWH^ 789 

Query: 122 GVYTCLAQNTGGQVLCKAELLV 143 

G Y L +N G+ C+ L++ 
Sbjct: 790 GQYEILLKNRVGECSCQVSLML 811 (SEQ ID NO:33) 



Score = 43.8 bits (101), Expect = 0.015 
Identities = 26/89 (29%), Positives = 35/89 (39%) 



Query 1160 PTFLRELSDETVVLGQSVTLACQVSAQPAAQATWSKDGAPLESSSRVLISATLKNFQLLT 1219 

PF + + DWG+ C++P + WKD+S I L 
Sbjct: 1808 PYFSKTIRDLEWEGSAARFDCKIEGYPDPEWWFKDDQSIRESRHFQIDYDEDGNCSLI 1867 

Query: 1220 ILVWAEDLGVYTCSVSNALGTVTTTGVL 1248 

I V +D YTC N+LG T T L 
Sbjct: 1868 ISDVCGDDDAKYTCKAVNSLGEATCTAEL 1896 (SEQ ID N0:34) 



FIGURE 2L 
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1 CAGCACGAGG AACTCCTTCT GATCACCTGG CCAGCTGAGG TCAGAGTGGG 
51 A6AGGCAGTG GTTCCATTGA AGGAGTACTC CTAACTGTCA GAAGCCTGGG 
101 CGGTCAGGAT GGGGTGCTGT CGCTTGGGCT GCGGGGGGTG TTCAGTTGCC 
151 CACAGTGTAT CTCAGGGTCT CACCAACCAT CCAAGCATGG TAGGCTGTGG 
201 CTGGCACCCA GGGTTGTGTG GCTGGGGAGG TGGTCTCCAC AGTTCCCTCC 
251 CTGCCCTCCC AGGGCCCCCA TCCATGCAGG TAACCATCGA GGATGTGCAG 
301 GCACAGACAG GCGGAACGGC CCAATTCGAG GCTATCATTG AGGGCGACCC 
351 ACAGCCCTCG GTGACCTGGT ACAAGGACAG CGTCCAGCTG GTGGACAGCA 
401 CCCGGCTTAG CCAGCAGCAA 6AAGGCACCA CATACTCCCT GGTGCTGAGG 
451 CATGTGGCCT CGAAGGATGC CGGCGTTTAC ACCTGCCTGG CCCAAAACAC 
501 TGGTGGCCAG GTGCTCTGCA AGGCAGAGCT GCTGGTGCTT GGGGGGGACA 
551 ATGAGCCGGA CTCAGAGAAG CAAAGCCACC GGAGGAAGCT GCACTCCTTC 
601 TATGAGGTCA AGGAGGAGAT TGGAAGGGGC GTGTTTGGCT TCGTAAAAAG 
651 AGTGCAGCAC AAAGGAAACA AGATCTTGTG CGCTGCCAAG TTCATCCCCC 
701 TACGGAGCAG AACTCGGGCC CAGGCATACA GGGAGCGAGA CATCCTGGCC 
751 GCGCTGAGCC ACCCGCTGGT CACGGGGCTG CTGGACCAGT TTGAGACCCG 
801 CAAGACCCTC ATCCTCATCC TGGAGCTGTG CTCATCCGAG GAGCTGCTGG 
851 ACCGCCTGTA CAGGAAGGGC GTGGTGACGG AGGCCGAGGT CAAGGTCTAC 
901 ATCCAGCAGC TGGTGGAGGG GCTGCACTAC CTGCACAGCC ATGGCGTTCT 
951 CCACCTGGAC ATAAAGCCCT CTAACATCCT GATGGTGCAT CCTGCCCGGG 
1001 AAGACATTAA AATCTGCGAC TTTGGCTTTG CCCAGAACAT CACCCCAGCA 
1051 GAGCTGCAGT TCAGCCAGTA CGGCTCCCCT GAGTTCGTCT CCCCCGAGAT 
1101 CATCCAGCAG AACCCTGTGA GCGAAGCCTC CGACATTTGG GCCATGGGTG 
1151 TCATCTCCTA CCTCAGCCTG ACCTGCTCAT CCCCATTTGC CGGCGAGAGT 
1201 GACCGTGCCA CCCTCCTGAA CGTCCTGGAG GGGCGCGTGT CATGGAGCAG 
1251 CCCCATGGCT GCCCACCTCA GCGAAGACGC CAAAGACTTC ATCAAGGCTA 
1301 CGCTGCAGAG AGCCCCTCAG GCCCGGCCTA GTGCGGCCCA GTGCCTCTCC 
1351 CACCCCTGGT TCCTGAAATC CATGCCTGCG GAGGAGGCCC ACTTCATCAA 
1401 CACCAAGCAG CTCAAGTTCC TCCTGGCCCG AAGTCGCTGG CAGCGTTCCC 
1451 TGATGAGCTA CAAGTCCATC CTGGTGATGC GCTCCATCCC TGAGCTGCTG 
1501 CGGGGCCCAC CCGACAGCCC CTCCCTCGGC GTAGCCCGGC ACCTCTGCAG 
1551 GGACACTGGT GGCTCCTCCA GTTCCTCCTC CTCCTCTGAC AACGAGCTCG 
1601 CCCCATTTGC CCGGGCTAAG TCACTGCCAC CCTCCCCGGT GACACACTCA 
1651 CCACTGCTGC ACCCCCGGGG CTTCCTGCGG CCCTCGGCCA GCCTGCCTGA 
1701 GGAAGCCGAG GCCAGTGAGC GCTCCACCGA GGCCCCAGCT CCGCCTGCAT 
1751 CTCCCGAGGG TGCCGGGCCA CCGGCCGCCC AGGGCTGCGT GCCCCGGCAC 
1801 AGCGTCATCC GCAGCCTGTT CTACCACCAG GCGGGTGAGA GCCCTGAGCA 
1851 CGGGGCCCTG GCCCCGGGGA GCAGGCGGCA CCCGGCCCGG CGGCGGCACC 
1901 TGCTGAAGGG CGGCTACATT GCGGGGGCGC TGCCAGGCCT GCGCGAGCCA 
1951 CTGATG^AGC ACCGCGTGCT GGAGGAGGAG GCCGCCAGGG AGGAGCAGGC 
2001 CACCCTCCTG GCCAAAGCCC CCTCATTCGA GACTGCCCTC CGGCTGCCTG 
2 051 CCTCTGGCAC CCACTTGGCC CCTGGCCACA GCCACTCCCT GGAACATGAC 
2101 TCTCCGAGCA CCCCCCGCCC CTCCTCGGAG GCCTGCGGTG AGGCACAGCG 
2151 ACTGCCTTCA GCCCCCTCCG GGGGGGCCCC TATCAGGGAC ATGGGGCACC 
2201 CTCAGGGCTC CAAGCAGCTT CCATCCACTG GTGGCCACCC AGGCACTGCT 
2251 CAGCCAGAGA GGCCATCCCC GGACAGCCCT TGGGGGCAGC CAGCCCCTTT 
2301 CTGCCACCCC AAGCAGGGTT CTGCCCCCCA GGAGGGCTGC AGCCCCCACC 
2351 CAGCAGTTGC CCCATGCCCT CCTGGCTCCT TCCCTCCAGG ATCTTGCAAA 
2401 GAGGCCCCCT TAGTACCCTC AAGCCCCTTC TTGGGACAGC CCCAGGCACC 
2451 CCCTGCCCCT GCCAAAGCAA GCCCCCCATT GGACTCTAAG ATGGGGCCTG 
2501 GAGACATCTC TCTTCCTGGG AGGCCAAAAC CCGGCCCCTG CAGTTCCCCA 
2551 GGGTCAGCCT CCCAGGCGAG CTCTTCCCAA GTGAGCTCCC TCAGGGTGGG 
2601 CTCCTCCCAG GTGGGCACAG AGCCTGGCCC CTCCCTGGAT GCGGAGGGCT 
2651 GGACCCAGGA GGCTGAGGAT CTGTCCGACT CCACACCCAC CTTGCAGCGG 
2701 CCTCAGGAAC AGGTGACCAT GCGCAAGTTC TCCCTGGGTG GTCGCGGGGG 
2751 CTACGCAGGC GTGGCTGGCT ATGGCACCTT TGCCTTTGGT GGAGATGCAG 
2801 GGGGCATGCT GGGGCAGGGG CCCATGTGGG CCAGGATAGC CTGGGCTGTG 
2851 TCCCAGTCGG AGGAGGAGGA GCAGGAGGAG GCCAGGGCTG AGTCCCAGTC 
2901 GGAGGAGCAG CAGGAGGCCA GGGCTGAGAG CCCACTGCCC CAGGTCAGTG 
2951 CAAGGCCTGT GCCTGAGGTC GGCAGGGCTC CCACCAGGAG CTCTCCAGAG 
3001 CCCACCCCAT GGGAGGACAT CGGGCAGGTC TCCCTGGTGC AGATCCGGGA 
3051 CCTGTCAGGT GATGCGGAGG CGGCCGACAC AATATCCCTG GACATTTCCG 
3101 AGGTGGACCC CGCCTACCTC AACCTCTCAG ACCTGTACGA TATCAAGTAC 
3151 CTCCCATTCG AGTTTATGAT CTTCAGGAAA GTCCCCAAGT CCGCTCAGCC 
3201 AGAGCCGCCC TCCCCCATGG CTGAGGAGGA GCTGGCCGAG TTCCCGGAGC 
3251 CCACGTGGCC CTGGCCAGGT GAACTGGGCC CCCACGCAGG CCTGGAGATC 
3301 ACAGAGGAGT CAGAGGATGT GGACGCGCTG CTGGCAGAGG CTGCCGTGGG 
3351 CAGGAAGCGC AAGTGGTCCT CGCCGTCACG CAGCCTCTTC CACTTCCCTG 
3401 GGAGGCACCT GCCGCTGGAT GAGCCTGCAG AGCTGGGGCT GCGTGAGAGA 
3451 GTGAAGGCCT CCGTGGAGCA CATCTCCCGG ATCCTGAAGG GCAGGCCGGA 
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3501 AGGTCTGGAG AAGGAGGGGC CCCCCAGGAA GAAGCCAGGC CTTGCTTCCT 
3551 TCCGGCTCTC AGGTCTGAAG AGCTGGGACC GAGCGCCGAC ATTCCTAAGG 
3601 GAGCTCTCAG ATGAGACTGT GGTCCTGGGC CAGTCAGTGA CACTGGCCTG 
3651 CCAGGTGTCA GCCCAGCCAG CTGCCCAGGC CACCTGGAGC AAAGACGGAG 
3701 CCCCCCTGGA GAGCAGCAGC CGTGTCCTCA TCTCTGCCAC CCTCAAGAAC 
3751 TTCCAGCTTC TGACCATCCT GGTGGTGGTG GCTGAGGACC TGGGTGTGTA 
3801 CACCTGCAGC GTGAGCAATG CGCTGGGGAC AGTGACCACC ACGGGCGTCC 
3 851 TCCGGAAGGC AGAGCGCCCC TCATCTTCGC CATGCCCGGA TATCGGGGAG 
3901 GTGTACGCGG ATGGGGTGCT GCTGGTCTGG AAGCCCGTGG AATCCTACGG 
3951 CCCTGTGACC TACATTGTGC AGTGCAGCCT AGAAGGCGGC AGCTGGACCA 
4001 CACTGGCCTG CGACATCTTT GACTGCTGCT ACCTGACCAG CAAGCTCTCC 
4051 CGGGGTGGCA CCTACACCTT CCGCACGGCA TGTGTCAGCA AGGCAGGAAT 
4101 GGGTCCCTAC AGCAGCCCCT CGGAGCAAGT CCTCCTGGGA GGGCCCAGCC 
4151 ACCTGGCCTC TGAGGAGGAG AGCCAGGGGC GGTCAGCCCA ACCCCTGCCC 
4201 AGCACAAAGA CCTTCGCATT CCAGACACAG ATCCAGAGGG GCCGCTTCAG 
4251 CGTGGTGCGG CAATGCTGGG AGAAGGCCAG CGGGCGGGCG CTGGCCGCCA 
4301 AGATCATCCC CTACCACCCC AAGGACAAGA CAGCAGTGCT GCGCGAATAC 
4351 GAGGCCCTCA AGGGCCTGCG CCACCCGCAC CTGGCCCAGC TGCACGCAGC 
44 01 CTACCTCAGC CCCCGGCACC TGGTGCTCAT CTTGGAGCTG TGCTCTGGGC 
4451 CCGAGCTGCT CCCCTGCCTG GCCGAGAGGG CCTCCTACTC AGAATCTGAG 
4501 GTGAAGGACT ACCTGTGGCA GATGTTGAGT GCCACCCAGT ACCTGCACAA 
4551 CCAGCACATC CTGCACCTGG ACCTGAGGTC CGAGAACATG ATCATCACCG 
4601 AATACAACCT GCTCAAGGTC GTGGACCTGG GCAATGCACA GAGCCTCAGC 
4651 CAGGAGAAGG TGCTGCCCTC AGACAAGTTC AAGGACTACC TAGAGACCAT 
4701 GGCTCCAGAG CTCCTGGAGG GCCAGGGGGC TGTTCCACAG ACAGACATCT 
4751 GGGCCATCGG TGTGACAGCC TTCATCATGC TGAGCGCCGA GTACCCGGTG 
4801 AGCAGCGAGG GTGCACGCGA CCTGCAGAGA GGACTGCGCA AGGGGCTGGT 
4851 CCGGCTGAGC CGCTGCTACG CGGGGCTGTC CGGGGGCGCC GTGGCCTTCC 
4901 TGCGCAGCAC TCTGTGCGCC CAGCCCTGGG GCCGGCCCTG CGCGTCCAGC 
4951 TGCCTGCAGT GCCCGTGGCT AACAGAGGAG GGCCCGGCCT GTTCGCGGCC 
5001 CGCGCCCGTG ACCTTCCCTA CCGCGCGGCT GCGCGTCTTC GTGCGCAATC 
5051 GCGAGAAGAG ACGCGCGCTG CTGTACAAGA GGCACAACCT GGCCCAGGTG 
5101 CGCTGAGGGT CGCCCCGGCC ACACCCTTGG TCTCCCCGCT GGGGGTCGCT 
5151 GCAGACGCGC CAATAAAAAC GCACAGCCGG 6CGAGAAAAA AAAAAAAAAA 
5201 AAAAAAA (SEQ ID NO:3) 



FEATURES : 
Start: 109 
Exon: 109-5103 
Stop: 5104 



SNPs: 
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Context : 
DNA 

Position 

311 AACTCCTTCTGATCACCTGGCCAGCTGAGGTCAGAGTGGGAGAGGCAGTGGTTCCATTGA 
AGGAGTACTCCTAACTGTCAGAAGCCTGGGCGGTCAGGATGGGGTGCTGTCGCTTGGGCT 
GCGGGGGGTGTTCAGTTGCCCACAGTGTATCTCAGGGTCTCACCAACCATCCAAGCATGG 
TAGGCTGTGGCTGGCACCCAGGGTTGTGTGGCTGGGGAGGTGGTCTCCACAGTTCCCTCC 
CTGCCCTCCCAGGGCCCCCATCCATGCAGGTAACCATCGAGGATGTGCAGGCACAGACAG 
[T,C,G] 

CGGAACGGCCCAATTCGAGGCTATCATTGAGGGCGACCCACAGCCCTCGGTGACCTGGTA 
CAAGGACAGCGTCCAGCTGGTGGACAGCACCCGGCTTAGCCAGCAGCAAGAAGGCACCAC 
ATACTCCCTGGTGCTGAGGCATGTGGCCTCGAAGGATGCCGGCGTTTACACCTGCCTGGC 
CCAAAACACTGGTGGCCAGGTGCTCTGCAAGGCAGAGCTGCTGGTGCTTGGGGGGGACAA 
TGAGCCGGACTCAGAGAAGCAAAGCCACCGGAGGAAGCTGCACTCCTTCTATGAGGTCAA 
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1741 CAGC»TTCCCTGATGAGCTACAAGTCa^TCCTGGTGaTGa3CTCCATCCCT^ 
OSGGGCCCACCCGACAGCCCCTCCCTCXSGCGTAGCCaSGCACCTCrGC^^ 
GGCTCCTCCAGTTCCTCCrCCTCCTCTGACAACGAGCraSCCCCATTTGCCCGGGCTAAG 
TCA(n^CCACCCTCCCCGGTGACACyVCTCACCACTGCnX3CACCCCCGG^ 

CCCT(XK3CCAGCCTGCCTGAGGAAGCCGAGGCCAGT6AGCGCTCCACCGAGGCCCCAGCT 
[C,T1 

CGCCTGCATCTCCCGAGGGTGCCGGGCCACCGGCCGCCCAGGGCTGCGTGCCCCGGCACA 
GCX3TCATCCGCA6CCTGTTCTACCACCAGGC6G6TGAGA6CCCTGAGCACGGGGCCCTGG 
CCCCX3GGGAGCAGGCGGCACCCX3GCCCGGCGGCGGCACCTGCTGAAGGGCGGCTACATTG 
CGGGGGCGCTGCCA66CCTGCGCGAGCCACTGATGGAGCACCGCGTGCTGGAGGAGGAGG 
CCGCCAGGGAGGAGCAGGCCACCCTCCTGGCCAAAGCCCCCTCATTCGAGACTGCCCTCC 

2714 TACCCTCAAGCCCCTTCTTGGGACAGCCCCAGGCACCCCCTGCCCCTGCCAAAGCAAGCC 
CCCCATTGGACTCTAAGATGGGGCCTGGAGACATCTCTCTTCCTGGGAGGCCAAAACCCG 
GCCCCTGCAGTTCCCCAGGGTCAGCCTCCCAGGCGAGCTCTTCCCAAGTGAGCTCCCTCA 
GGGTGGGCTCCTCCCAGGTGGGCACAGAGCCTGGCCCCTCCCTGGATGCGGAGGGCTGGA 
CCCAGGAGGCTGAGGATCTGTCCGACTCCACACCCACCTTGCAGCGGCCTCAGGAACAGG 
tT,Cl 

GACCATGCGCAAGTTCTCCCTGGGTGGTCGCGGGGGCTACGCAGGCGTGGCTGGCTATGG 
CACCTTTGCCTTTGGTGGAGATGCAGGGGGCATGCTGGGGCAGGGGCCCATGTGGGCCAG 
GATAGCCTGGGCTGTGTCCCAGTCGGAGGAGGAGGAGCAGGAGGAGGCCAGGGCTGAGTC 
CCAGTCGGAGGAGCAGCAGGAGGCCAGGGCTGAGAGCCCACTGCCCCAGGTCAGTGCAAG 
GCCTGTGCCTGAGGTCGGCAGGGCTCCCACCAGGAGCTCTCCAGAGCCCACCCCATGGGA 

2745 GGCACCCCCTGCCCCTGCCAAAGCAAGCCCCCCATTGGACTCTAAGATGGGGCCTGGAGA 
CATCTCTCTTCCTGGGAGGCCAAAACCCGGCCCCTGCAGTTCCCCAGGGTCAGCCTCCCA 
GGCGAGCTCTTCCCAAGTGAGCTCCCTCAGGGTGGGCTCCTCCCAGGTGGGCACAGAGCC 
TGGCCCCTCCCTGGATGCGGAGGGCTGGACCCAGGAGGCTGAGGATCTGTCCGACTCCAC 
ACCCACCTTGCAGCGGCCTCAGGAACAGGTGACCATGCGCAAGTTCTCCCTGGGTGGTCG 
[C,T] 

GGGGGCTACGCAGGCGTGGCTGGCTATGGCACCTTTGCCTTTGGTGGAGATGCAGGGGGC 
ATGCTGGGGCAGGGGCCCATGTGGGCCAGGATAGCCTGGGCTGTGTCCCAGTCGGAGGAG 
GAGGAGCAGGAGGAGGCCAGGGCTGAGTCCCAGTCGGAGGAGCAGCAGGAGGCCAGGGCT 
GAGAGCCCACTGCCCCAGGTCAGTGOVAGGCCTGTGCCTGAGGTCGGCAGGGCTCCCACC 
AGGAGCTCTCCAGAGCCCACCCCATGGGAGGACATCGGGCAGGTCTCCCTGGTGCAGATC 

2859 CTCCCAGGCGAGCTCTTCCCAAGTGAGCTCCCTCAGGGTGGGCTCCTCCCAGGTGGGCAC 
AGAGCCTGGCCCCTCCCTGGATGCGGAGGGCTGGACCCAGGAGGCTGAGGATCTGTCCGA 
CTCCACACCCACCTTGCAGCGGCCTCAGGAACAGGTGACCATGCGCAAGTTCTCCCTGGG 
TGGTCGOSGGGGCTACGa^GGCXSTGGCIXKfCrATGGGy^CC^ 

AGGGGGCATGCTGGGGCAGGGGCCCATGTGGGCCAG6ATAGCCTGGGCTGTGTCCCAGTC 
tA,G] 

6AGGAGGAGGAGCAGGAGGAGGCCAGGGCTGAGTCCCAGTCGGAGGAGCAGCAGGAGGCC 
AGGGCTGAGAGCCCACTGCCCCAGGTCAGTGCAAGGCCTGTGCCTGAGGTCGGCAGGGCT 
CCCACCAGGAGCTCTCCAGAGCCCACCCCATGGGAGGACATCGGGCAGGTCTCCCTGGTG 
CAGATCCGGGACCT6TCAGGTGATGCGGAGGCGGCCGACACAATATCCCTGGACATTTCC 
GAGGTGGACCCCGCCTACCTCAACCTCTCAGACCTGTACGATATCAAGTACCTCCCATTC 

3420 CAACCTCTCAGACCTGTACGATATCAAGTACCTCCCATTCGAGTTTATGATCTTCAGGAA 
AGTCCCCAAGTCCGCTCAGCCAGAGCCGCCCTCCCCCATGGCTGAGGAGGAGCTGGCCGA 
6TTCCCGGAGCCCACGTGGCCCTGGCCAGGTGAACTGGGCCCCCACGCAGGCCTGGAGAT 
CTVCAGAGGAGTCAGAGGATGTGGACGCGCTGCTGGCAGAGGCTGCCGTGGGCAGGAAGCG 
CAAGTGGTCCTCXSCCGTCAasa^GCCTCTTCCACTTCCCTGGGAGGCACCTGCCGCTGGA 
[T,C] 

GAGCCTGCAGAGCTGGGGCTGCGTGAGAGAGTGAAGGCCTCCGTGGAGCACATCTCCCGG 

ATCCn^AAGGGCAGGCCGGAAGGTCTGGAGAAGGAGGGGCCCCCCAGGAAGAAGCCAGGC 

CTTGCTTCCTTCCGGCTCTCAGGTCTGAAGAGCroGGACCGAGCGCaSACATTCCT;^ 

GAGCTCTCAGATGAGACTGTGGTCCTGGGCaVGTCAGTGACACKMCCnX3CCAGGTGTC^ 

GCCCAGCCAGCTGCCCAGGCCACCri^AGCAAAGACGGAGCCCCCCTGGAGAGCAGCAG 

Chromosome map position: 1 
Bac accession number: AC023889 
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